Abstract 

Sofic groups were denned implicitly by Gromov in [Gr99] and explicitly by Weiss 
in [WeOO]. All residually finite groups (and hence all linear groups) are sofic. The 
purpose of this paper is to introduce, for every countable sofic group G, a family of 
measure-conjugacy invariants for measure-preserving G- actions on probability spaces. 
These invariants generalize Kolmogorov-Sinai entropy for actions of amenable groups. 
They are computed exactly for Bernoulli shifts over G, leading to a complete classi- 
fication of Bernoulli systems up to measure-conjugacy for many groups including all 
countable linear groups. Recent rigidity results of Y. Kida and S. Popa are utilized to 
classify Bernoulli shifts over mapping class groups and property (T) groups up to orbit 
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1 Introduction 

This paper is motivated by an old and central problem in measurable dynamics: given two 
dynamical systems, determine whether or not they are measurably-conjugate, i.e., isomor- 
phic. Let us set some notation. 

A dynamical system (or system for short) is a triple (G, X, //) where (X, //) is a prob- 
ability space and G is a group acting by measure-preserving transformations on (X,fi). 
We will also call this a dynamical system over G, a G-system or an action of G. 
In this paper, G will always be a discrete countable group. Two systems (G, X, ji) and 
(G,Y,u) are isomorphic (i.e., measurably conjugate) if and only if there exist conull 
sets X' C X, Y' C Y and a bijective measurable map : X' — > Y' such that <fi~ l :Y'—^X' 
is measurable, 0*/i = v and 4>{gx) = g(j)(x)\/g G G, x G X' . 

A special class of dynamical systems called Bernoulli systems or Bernoulli shifts 
has played a significant role in the development of the theory as a whole because it was 
the problem of trying to classify them that motivated Kolmogorov to introduce the mean 
entropy of a dynamical system over Z [Ko58, Ko59]. That is, Kolmogorov defined for every 
system (Z, X, fi) a number /i(Z, X, fi) called the mean entropy of (Z, X, //) that quantifies, 
in some sense, how "random" the system is. His definition was modified by Sinai [Si59]; the 
latter has become standard. 

Bernoulli shifts also play an important role in this paper, so let us define them. Let (K, k) 
be a standard Borel probability space. For a discrete countable group G, let K G = Ylqec ^ 
be the set of all functions x : G — > K with the product Borel structure and let k g be the 
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product measure on K G . The group G acts on K G by (gx)(f) = x{g~ x f) for x G K G and 
g, f G G. This action is measure-preserving. The system (G, K G , k g ) is the Bernoulli shift 
over G with base (K, k). It is nontrivial if k is not supported on a single point. 

Before Kolmogorov's seminal work [Ko58, Ko59], it was unknown whether all non- 
trivial Bernoulli shifts over Z were measurably conjugate to each other. He proved that 
/i(Z, K z , k, z ) = H(k) where H(n), the entropy of k is defined as follows. If there exists a 
finite or countably infinite set K' C K such that k(K') = 1 then 

ff(K) = -£>({*}) kg (/*({*})) 

keK' 

where we follow the convention 01og(0) = 0. Otherwise, H(k) = +00. Thus two Bernoulli 
shifts over Z with different base measure entropies cannot be measurably conjugate. 

The converse was proven by D. Ornstein in the groundbreaking papers [Or70a, Or70b]. 
That is, he proved that if two Bernoulli shifts (Z, K z , k z ), (Z, L z , X z ) are such that H(k) = 
H(X) then they are isomorphic. 

In [Ki75] , Kieffer proved a Shannon-McMillan theorem for actions of a countable amenable 
group G. In particular, he extended the definition of mean entropy from Z-systems to G- 
systems. It is then not difficult to show from that Kolmogorov's theorem extends to Bernoulli 
shifts over G. 

In the landmark paper [OW87], Ornstein and Weiss extended most of the classical entropy 
theory from Z-systems to G-systems where G is any countable amenable group. (This paper 
also contains many results for nondiscrete amenable groups). In particular, they proved that 
if two Bernoulli shifts (G, K G , k g ), (G, L g , X g ) over a countably infinite amenable group G 
are such that H(k) = H(X) then they are isomorphic. Thus Bernoulli shifts over G are 
completely classified by base measure entropy. 

Let us say that a group G is Ornstein if H(k) = H(X) implies (G, K G , k g ) is isomorphic 
to (G, L G , X G ) where (K, k) and (L, X) are any two standard Borel probability spaces. By the 
above, all countably infinite amenable groups are Ornstein. Stepin proved that any countable 
group that contains an Ornstein subgroup is itself Ornstein [St 75]. This paper is not widely 
available; but a proof is also supplied in [Bo08b]. It is apparently unknown whether or not 
every countably infinite group is Ornstein. But an open case is that of Ol'shanskii's monsters 
[0191]. 

At the end of [OW87], Ornstein and Weiss presented a curious example suggesting that 
there might not be a reasonable entropy theory for nonamenable groups. It pertains to a 
well-known fundamental property of entropy: it is nonincreasing under factor maps. To 
explain, let (G, X, fi) and (G, Y, v) be two systems. A map : X — > Y is a factor if 0*/i = v 
and <j>(gx) = g<f>(x) for a.e. x G X and every g G G. If G is amenable then the mean entropy 
of a factor is less than or equal to the mean entropy of the source. This is essentially due to 
Sinai [Si59]. So if K n = {1, . . . ,n} and K n is the uniform probability measure on K n then 
(G,K g ,k g ), which has entropy log(2), cannot factor onto (G,K G , k g ), which has entropy 
log(4). 

The argument above fails if G is nonamenable. Indeed, let G = (a, b) be a rank 2 free 



group. Identify K 2 with the group Z/2Z and K 4 with Z/2Z x Z/2Z. Then 

<f>( x )(9) '■= i x (g) + x(ga),x(g) + x(gb)) V16 K?,g e G 

is a factor map from (G, K 2 , k, 2 ) onto (G, Kf, K,f). This is Ornstein- Weiss' example. 

There is an obvious factor map from (G, Kf, nf) onto (G, K^ , k 2 ) (for any group G), so 
the authors speculated that if G = (a, b) then these two Bernoulli shifts might be measurably 
conjugate. We now know that this is false. The paper [Bo08a] introduced an invariant for 
dynamical systems over a free group that behaves similarly to Kolmogorov-Sinai entropy. In 
particular, it shows that Bernoulli shifts over a free group are completely classified by base 
measure entropy. 

The purpose of this paper is define, for every countable sofic group G, a family of iso- 
morphism invariants that enables us to completely classify Bernoulli shifts over a countable 
sofic Ornstein group. It is expected that, as with Kolmogorov-Sinai entropy, these invariants 
will have an impact broader than this initial application to Bernoulli shifts. So what is a 
sofic group? Before stating the definition, let us note a few facts. These groups were defined 
implicitly by Gromov in [Gr99] and explicitly by Weiss in [WeOO]. An almost immediate 
consequence of the definition is that all residually amenable groups are sofic. In particular, 
since linear groups (i.e., subgroups of GL n (F) where F is a field) are residually finite (by 
[Ma40]) they are sofic. It is unknown whether every countable group is sofic but an unre- 
solved case is that of the universal Burnside group on a finite set of generators. Pestov has 
written a beautiful up-to-date survey [Pe08] on sofic groups and their siblings, hyperlinear 
groups. 

Definition 1. Let G be a countable group. For m > 1, let Sym(m) denote the full sym- 
metric group on {1, . . . ,m}. Let o : G — ► Sym(m) be a map. a is not assumed to be a 
homomorphism! For F C G, let V(F) C {1, . . . , m} be the set of all elements v such that 
for all /i,/ 2 eF, 

<y{fi)<y{f 2 )v = a(fif 2 )v 

and cr(fi)v ^ cr(f 2 )v if f\ ^ f 2 . a is an (F, e)-approximation to G if | V(F)| > (1 — e)m. 

Let £ = {(Xi}^ be a sequence of maps Oi : G — > Sym(mj). Then E is a sofic approx- 
imation to G if each a, is an (F t , ^-approximation to G for some (i^, 6j) where F^ C F i+i 
for all i, Uj.Fj = G and q — ► as % — >■ oo. G is sofic if there exists a sofic approximation to 
G. 

Example 1. If G is residually finite then there exists a sequence {iVj} of finite-index normal 
subgroups of G with iVj + i < iVj for all % and DjiVj = {e}. Let &i : G — *■ Sym(G/A ? j) 
be the canonical homomorphism given by the action of G on G/N^. Then {o"j} is a sofic 
approximation to G. 

Example 2. If G is amenable then there exists an increasing sequence {Fi} of finite subsets 
of G such that (J^ Fj = G and for every finite K G G 

, I^AFJ 

lim — — = 1. 



Let Oi : G — > Sym(Fj) be any map such that if / G Fi, g G G and gf G F, then <?i(g)f = gf '■ 
Then {<Tj} is a sofic approximation to G. 

In section [21 we define the entropy of a system (G, X, /i) with respect to a sofic approxima- 
tion S. It is denoted /i(S, G, X, /i). The proof that this entropy is invariant under measure- 
conjugacy occupies sections 0] - CD In section [HI it is proven that /i(£, G, K G , k g ) = H(k) 
whenever H(k) < oo. This implies the next result. 

Theorem 1.1. Let G be a countable sofic group and let (Kx, Kx), (K 2 , K 2 ) be standard Borel 
probability spaces such that H(kx)+H(k 2 ) < oo. If (G, K G , k g ) is isomorphic to (G, K G , k g ) 
then H(kx) = H(k 2 ). 

In section [HJ it is shown that if G is also Ornstein then the finiteness condition in the 
above theorem can be removed. Thus: 

Corollary 1.2. Let G be a countable sofic Ornstein group. Let (Kx,Kx),(K 2 ,k 2 ) be stan- 
dard Borel probability spaces. Then (G, K G , k g ) is isomorphic to (G, K G , k g ) if and only if 
H(k 1 )=H(k 2 ). 

To make a contrast, recall that two systems (G, X, fi), (G, Y, v) are weakly isomorphic 
if (G, X, fi) is a factor of (G, Y, v) and (G, Y, v) is a factor of (G, X, fi). The next theorem is 
proven in [Bo08b]. 

Theorem 1.3. Let G be a countable group that contains a nonabelian free subgroup. Let 
(Kx, Kx), (K 2 , K2) be any two nontrivial standard Borel probability spaces. Then (G, K G , k g ) 
is weakly isomorphic to (G, K G , n G ). 

The main ingredient in the proof is Ornstein- Weiss' example. In section [HJ this is used 
to prove: 

Theorem 1.4. Let G be a countable sofic group that contains a nonabelian free subgroup. 
Let (K,k) be a standard Borel probability space with H(k) = +00. Then (G,K g ,k g ) does 
not have a finite- entropy generating partition. 

The conclusion to this theorem is well-known to hold if G is amenable. It is apparently 
unknown whether this result holds for all countable groups. 

Let us consider the special case in which G is a countably infinite linear group. Then 
every finitely generated subgroup of G is residually finite by [Ma40]. Hence, G is sofic. By 
the celebrated Tits alternative [Ti72] , any finitely generated subgroup of G is either virtually 
solvable (and hence amenable) or contains a nonabelian free group. Thus either G contains 
a nonabelian free subgroup or it is amenable. In either case, it is Ornstein. Therefore, the 
following is proven. 

Corollary 1.5. Let G be a countably infinite linear group. If (Kx, Kx), (K 2 , k 2 ) are standard 
Borel probability spaces then (G, K G , k g ) is isomorphic to (G, K G , k 2 ) if and only if 'H(kx) = 
H(k 2 ). G is nonamenable if and only if every two nontrivial Bernoulli shifts over G are 
weakly isomorphic. IfH(Kx) = +00 then there are no finite- entropy generating partitions for 
(G,K g ,k g ). 



1.1 Conjugation up to automorphisms 

There following definition is important in the applications that follow. 

Definition 2. Two systems (Gi,Xi,fj,i) and (G 2 ,X 2 ,/i 2 ) are conjugate up to automor- 
phisms if there exists an isomorphism $ : G\ — > G2 and a measure-space isomorphism 
<p : X[ — > X' 2 (where X[ is a conull subset of Xj) such that <p(gx) = <$>(g)(f)(x) for every 
g G Cri and a; G X{. 

For example, let (G, X, fi) be a system and let a : G — > G be an automorphism. Let 
(X a ,/x a ) be a copy of (X, //). Define an action of G on (X a ,/i a ) by g • x = a(g)x for 
g G G, x G X a . Then (G, X, /x) and (G, X a ,/i a ) are conjugate up to automorphisms. It is 
possible that they are not isomorphic as G-systems. 

Theorem 1.6. Let G be a countable sofic group and let (K±, k±), (K 2 , K 2 ) be standard Borel 
probability spaces such that H{k\) + H(k,2) < 00. If (G, Kf, nf) is conjugate up to automor- 
phisms to (G, K 2 , K2) then H{k\) = H(k 2 ). 



Corollary 1.7. Let G be a countable sofic Ornstein group. Let (X 1? Ki), (X 2 , k 2 ) be standard 

-g k g 



Borel probability spaces. Then (G, Kf , nf) is conjugate up to automorphisms to (G, K 2 , k^ 



if and only if H(k±) = H(k2). 

This theorem and its corollary follow from theorem I2.1[ proposition 12.21 and lemma 12.31 
below. 

1.2 Orbit equivalence and von Neumann equivalence 

The purpose of this subsection is to show that if the group G satisfies certain additional 
hypotheses then the results above can be used to classify Bernoulli shifts over G up to orbit 
equivalence and even up to von Neumann equivalence. To recall the definitions, for i — 1, 2 
let (Gj,Xj,/ij) be a system. 

Definition 3. (Gi,Xi,/ii) and (G2, X 2 , ^2) are orbit-equivalent (OE) if there exist conull 
sets X[ C Xi, X' 2 C X 2 and a measure-space isomorphism <fi : X[ — > X' 2 such that for all 
x G X[, </>(Gix) = G 2 (j)(x). 

Orbit equivalence was introduced implicitly in [Si55] and explicitly in [Dy59] where it was 
shown that all free ergodic actions of Z are orbit equivalent. This was extended to countable 
amenable groups in [OW80, CFW81]. In the last decade, a number of striking rigidity results 
in orbit equivalence theory have been proven. These imply that under special additional 
hypotheses, orbit equivalence implies conjugacy up to automorphisms. For example, S. 
Popa proved [corollary 1.3, P0O8] that if Gi,G 2 are two countably infinite groups, G\ is 
nonamenable and G± x G 2 has no nontrivial finite normal subgroups then any two Bernoulli 
shifts over G\ x G 2 are orbit equivalent if and only if they are conjugate up to automorphisms. 
Thus theorem 11.61 and corollary 11.71 imply 



Corollary 1.8. Let G\,G 2 be countably infinite groups and suppose G\ is nonamenable. 
Suppose that G := Gi x G 2 is sofic and has no nontrivial finite normal subgroups. Let 
(Kx, «i), (K 2 , Kq) be standard Borel probability spaces with H(ki)+H(k 2 ) < 00. If(G,K^,nf) 
is orbit- equivalent to (G,K 2 ,k 2 ) then H(ki) = H(k 2 ). 

If G is also Ornstein, then H(Ki) is allowed to be infinite and the converse also holds. 
That is: (G, K?, «f ) is OE to (G,K^ K f) if and only if Hfa) = H(k 2 ). 

Y. Kida proved [Ki08, theorem 1.4] that if G is the mapping class group of a genus g, 
n-holed surface S 9tn for some (g,n) with 3g + n — 4 > and (g, n) £ {(1,2), (2,0)}, then 
any two Bernoulli shifts over G are orbit equivalent if and only if they are conjugate up 
to automorphisms. By [Gr74, see also Iv86], mapping class groups are residually finite and 
hence, sofic. It is well-known that they contain infinite cyclic subgroups and hence, are 
Ornstein. So corollary 11.71 implies: 

Corollary 1.9. Let G be the mapping class group of a genus g, n-holed surface S gtTl for 
some (g, n) with 3g + n — 4 > and (g, n) <£ {(1, 2), (2, 0)}. Let (Ki, Ki), (K 2 , K 2 ) be standard 
Borel probability spaces. Then (G, Kf, nf) is orbit-equivalent to (G, K 2 , k 2 ) if and only if 
H(k 1 )=H(k 2 ). 

It is worth pointing out that Popa's and Kida's results are much more general than we 
have used here. 

Let us now turn our attention towards von Neumann equivalence. A system (G, X, u) 
gives rise in a natural way to a von Neumann algebra L°°(X,fi) x G called the group 
measure space or crossed product construction of Murray and von Neumann [MvN36]. 
If G is infinite and the action is free and ergodic then L°°(X, //) x G is a II\ factor, a highly 
noncommutative infinite-dimensional algebra with a positive trace. It is a fundamental 
problem in the theory of von Neumann algebras to classify type I Ji-factors up to isomorphism 
in terms of the group/action data. This motivates the next definition. 

Definition 4. Two systems (G\,Xi, fii) and (G 2 , X 2 , fi 2 ) are von Neumann equivalent 
(vNE) if L°°(Xi,jUi) x G x is isomorphic to L°°(X 2 ,fi 2 ) x G 2 . 

It was shown in [Si55] that orbit equivalence implies von Neumann equivalence, indeed 
this insightful discovery motivated the study of orbit equivalence. In [Co 76], A. Connes 
proved that all II± factors arising from actions of amenable groups are isomorphic. By 
contrast, nonamenable groups were used to produce large families of nonisomorphic factors 
in [MvN43], [Dy63], [Sc63], [Mc69], [Co75]. 

In a series of groundbreaking papers [P0O6-P0O8], S. Popa established a variety of vNE 
rigidity results. These posit that under certain additional hypotheses von Neumann equiv- 
alence implies conjugacy up to automorphisms. The survey [Po07] covers many of these 
developments. For example, in [P0O6, corollary 0.2], it is proven that if G is a countably 
infinite property (T) group such that every nontrivial conjugacy class is infinite (this is ab- 
breviated as ICC) then two Bernoulli shifts over G are von Neumann equivalent if and only 
if they are conjugate up to automorphisms. Thus theorem 11.61 and corollary 11.71 imply the 
following. 

7 



Corollary 1.10. Let G be a countably infinite ICC sofic property (T) group. Let (Ki,Ki), 
(K 2 , K 2 ) be standard Borel probability spaces with H(ki) + H(k 2 ) < 00. If (G, K±, nf) is von 
Neumann equivalent to (G,K 2 ,k 2 ) then H(ki) = H(k 2 ). 

If, in addition, G is Ornstein then H[k\) and H(k 2 ) are allowed to be infinite and the 
converse also holds. That is, L°°(Ki, k x ) x G is isomorphic to L°°(K 2 , K 2 ) X G if and only 
ifH( Kl ) = H( K2 ). 

For example, if G = PSL n (1i) for n > 2 then G is a countably infinite, ICC, sofic, 
property (T), Ornstein group. The above result is a special case of a more general theorem. 
To state it we will need some definitions that are not widely known. For fundamental results 
related to these definitions, the reader is referred to [P0O6]. 

Definition 5. A group G is iw-rigid if it contains an infinite normal subgroup with the 
relative property (T) of Kazhdan-Margulis (in other words, (G, H) is a property (T)-pair, 
see [Ma82], [dHv89]). For example, all infinite groups with property (T) are w-rigid. 

Definition 6. A subgroup H < G is wg-normal if for every intermediate subgroup H < 
H' < G with H' 7^ G, there exists an element g G G such that \gH'gC\H'\ = +00. wg-normal 
stands for "weakly- quasinormal" . 

Definition 7. A group G is in the class w% if it contains a subgroup H such that 

• (G, H) is a property (T) pair, 

• H is not virtually abelian and 

• H is wg-normal in G. 

Corollary 1.11. Let G be an ICC sofic group. Suppose one of the following conditions hold: 

1. G is w-rigid or in the class wT . 

2. There is a nonamenable subgroup H < G such that C(H), the centralizer of H , is 
wq-normal in G and is not virtually abelian. 

Let (Ki, Ki), (K 2 , K2) be two standard Borel probability spaces such that H{ki) + H(k 2 ) < 00. 
If (G, Ki , nf) is von Neumann equivalent to (G, K^ , k 2 ) then H(k\) = H{k 2 )- 

If, in addition, G is Ornstein (for example, if G is linear) then H{k\) and H(k 2 ) are 
allowed to be infinite and the converse also holds. That is, L°°(Ki,Ki) x G is isomorphic 
to L°°(K£,k%) x G if and only if H{k x ) = H(k 2 ). 

Proof. This follows from theorem II. 6[ corollary 11.71 [P0O6, corollary 0.2] (if condition (1.) 
holds), and [P0O8, theorem 1.5] (if condition (2.) holds). □ 

The following generalizes corollary 11.81 



Corollary 1.12. Let G be a sofic group satisfying: G has no nontrivial finite normal sub- 
groups, G contains infinite commuting subgroups H, H' with H nonamenable and H' < G 
is wq-normal. Let (K±, Ki), (K2, n 2 ) be two standard Borel probability spaces such that 
H(k\) + H(k 2 ) < 00. If(G,K^, nf) is orbit- equivalent to (G,K 2 , k 2 ) then H(k,i) = H(k 2 ). 
If G is also Ornstein then H(k,i) and H(k 2 ) are allowed to be infinite and the converse 
also holds. 

Proof. This follows from theorem 11.61 corollary 11.71 and [P0O8, corollary 1.3]. □ 

2 The invariants 

In this section, we define the new invariants and state the main theorem. So fix a count- 
able group G. In this paper, all partitions a = (A±, . . . ) of a probability space (X, //) are 
measurable and at most countable. 

If £ = {o~i} is a sofic approximation of G, (G, X, /1) is a system and a is a finite partition 
of X then the E-entropy rate of a is, roughly speaking, the exponential rate of growth of the 
number of partitions (3 on {1, . . . , m,} that approximate a. So we begin by making precise 
a notion of approximation for such partitions. 

Definition 8. Let (G, X, /1) be a system and a = {A\, A 2 , . . . ) an ordered partition of X. 
Let a : G — > Sym(m) be a map, ( be the uniform probability measure on {1, . . . , m} and 
(3 = (B\, B 2 , . . . ) be a partition of {1, ... , m}. 

Let F C G be finite. Given a function <p : F — > N, let A^ = H/gf f^-Hf) an< ^ B <t> = 
V\ S eF a U) B m- Define 

d F (a,(3) = J2 \K A <p)~(( B 4 

The above definitions make sense even if a = (A\, . . . , A u ) or (3 = (Bi, . . . , B v ) are finite: 
just set Ai = Bj = for i > u and j > v. 

If a = (Ax, . . . , A u ) is finite then for e > 0, let AV(a, a : F, e) be the set of all ordered 
partitions /3 = (i?i, . . . , _B U ) of {1, . . . , m} with the same number of atoms as a such that 
dp(oi, /3) < e. AV stands for approximating partitions. 

Definition 9. A map sequence for G is a sequence E = {0",}°^ of maps o~i : G —»■ Sym(mj) 
such that irti — > 00 as i — > 00. 

Definition 10. Let (G, X, //) be a system and £ := {ctj}^! a map sequence for G. For every 
finite partition a of X, e > and finite set F G G, let 

H(E,a:F, e) = limsup — log \AV(o~i, a : F, e)| 

i— >oo Tij 

if (E, a : F) = limif (E, a : F, e). 
/i(E,a) = inf H(E,a:F). 

FdG 



A few words about the above definitions are in order. If ei > e 2 then AV(a, a : F, ei) D 
AV(a,a : F, €2), so the limit defining if(E,a : F) exists and equals the infimum over all 
e>0. 

The infimum defining /i(£, a) is over all finite sets F C G. We call h(H, a) the mean 
S-entropy of a. Note that if F x C F 2 then AV(a, a : F 1; e) D ^4"P(a, a : F 2 , e). Hence if 
{F n } is any sequence of finite subsets of G with F n C -F n+1 for all n and U n F n = G then 
/i(E, a) = lim n _ +00 if (E, a : F n ). 

It is possible that AV(<Ji,a : F, e) is empty. In this case, we interpret log(O) = —00. 
Thus, it is apriori possible that h(E, a) = —00. 

In order to handle the case when a is an infinite partition, we need to review some 
standard definitions. 

Definition 11. Let (X, /i) be a probability space and let a = (Ai, A2, ■ ■ ■ ) be a measurable 
partition of X into at most countably many sets (each of which is called an atom of a). 

The entropy of a is 

00 

#(<*) = -X>(4)iog(M^))- 

i=l 

Definition 12. If a and /3 are two partitions of X then their join, denoted a V /3, is defined 
by a V /? = {A n 5 I A G a, 5 G /?}. 

Definition 13. If a, /? are partitions of X and for every A G a there exists a 5 G (3 such 
that /i(A — B) — (i.e. A is a subset of B up to a set of measure zero) then we say that a 
refines j3. Equivalently, (3 is a coarsening of a. We denote this by a > (3. A chain of a is 
a sequence {a n }™ =1 of finite partitions such that a,\ < a 2 < ■ ■ ■ < a and V^i a i = a - 
Often, we will abuse notation by writing A C B to mean fi(A — B) = 0. 

Definition 14. Let (G,X,fj) be a system and £ := {0^}°^ a map sequence for G. Let 
a = (Al, Ai-, ■ ■ ■ ) be a partition of A. For every e > and finite set F G G, let 

i/(E, a : F) = inf < lim if (E, a n : F) : {a n }^ =1 is a chain of a k 

I n— >oo J 

We will prove in section O that H(E, a : F) = lim n ^ 00 if (E, a n : F) for any chain {a n } of a. 
As in the finite case, define h(E, a) = inf^ c c ii(S, a : F). 

Remark 1. The above definitions admit two natural generalizations. First, the limsup in the 
definition of H (E, a : F,e) can be replaced with a liminf or an ultralimit. The latter is very 
natural from the perspective on sofic groups taken in [ES05] and [Pe08] . Second, each map 
Oi : G — > Sym(mj) could be random. In this case, define 

ii(E, a : F, e) = limsup — logE \AT{pi, a : F,e) 

where E[-] denotes expected value. This is used in [Bo09] to show that the /-invariant defined 
in [Bo08a] is a special case of E-entropy. All of the results in this paper remain true if these 
two generalizations are utilized (with only minor, obvious changes in the proofs). But for 
simplicity's sake, we will not make use of either generalization. 
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In order to obtain a measure- conjugacy invariant, we need to focus on a special class of 
partitions described next. 

Definition 15. Let (G, X, /x) be a system and a a partition of X. Let E Q be the smallest G- 
invariant a-algebra containing the atoms of a. Then a is generating if for any measurable 
set A C X there exists a set A' 6 E a such that fi(AAA') = 0. 

The main theorem of this paper is: 

Theorem 2.1. Let G be a countable sofic group. Let E = {a{\°? =1 be a sofic approximation 
to G. Then if (G,X, /i) is any G-system and a,/3 are any two generating partitions of X 
with H{a) + H{f3) < oo then h{E, a) = h{E, f3) . 

This motivates the following definition: 

Definition 16. Let (G,X,fi), E be as above. If (G,X,/i) has a generating partition a with 
H(a) < oo then let /i(E,G, X, /i) = /i(S,a). By the above theorem, h(T>, G, X, /i) depends 
on the system (G, X, /i) only up to measure-conjugacy. 

Remark 2. If there does not exist a generating partition a with H(a) < oo then /i(E, G, X, n) 
is undefined. This differs from the classical case in which the mean entropy of a system is 
defined as the supremum of the mean entropy rate of a over all finite partitions a. In 
general, it is possible that the supremum of /i(E, a) over all finite partitions a is infinite even 
if /i(S, G, X, /i) is finite. For example, theorem 11.31 implies that this occurs for the Bernoulli 
shift (G, K G , k g ) if G contains a nonabelian free group. 

Remark 3. It can be shown that if G is amenable and if E is any sofic approximation to G 
then h(E,G,X, n) is the classical mean entropy of (G,X,fi). Since we will not use this, we 
do not prove it. 

In section [H] we show the following. 

Proposition 2.2. Let G be a countable sofic group. Let S be a sofic approximation to G. 
Let (K, k) be a probability space with H(k) < oo. Then /i(E, G, K G , k g ) = H(k). 

This proposition and theorem 12.11 imply theorem 11.11 Theorem 11.61 follows from this 
proposition, theorem 12.11 and the next lemma. 

Lemma 2.3. Let a : G — » G be an automorphism of a countable sofic group G. Let E = {cr,;} 
be a sofic approximation and (G, X, jj) a G-system. Let (X a ,fi a ) be a copy of (X,fi). Define 
an action of G on (X a ,fi a ) by g ■ x = a(g)x. Then h(T,,G,X, /i) = /i(E a , G, X a ,fi a ) where 
E a is the sequence {ai o a}^ 1 . 

Proof. Let e > 0, F C G be finite. Let a be a finite partition of X. For i > 0, define 
AV(ai, a : F, e) as in definition [SJ 

Let a a be the partition of X a corresponding to a (i.e., a a is a copy of a in X a ). Let 
AV a (cri o a, a a : a _1 (F),e) be defined as in definition [8] - but with the system (G,X a ,/j a ) in 
place of (G, X, ji). An exercise in definition-chasing shows that AV(ai, a : F, e) = AT >a {o- i o 
a,a a : a _1 (F),e). This implies h(T,,a) = h(T, a ,a a ). Since this is true for every finite 
partition a, the lemma follows. 

□ 
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2.1 An Outline of the Paper 



To prove theorem 12.11 we first review standard definitions and results from classical entropy 
theory. This is section [31 Then, we develop a general theory for the space of partitions in 
section HI This culminates in the establishment of simple criteria under which a function 
/ from the space of partitions to lU {— oo} is guaranteed to be constant on the set of 
generating partitions. It is shown in sections [M3 that h(E, ■) satisfies these conditions. This 
finishes the proof of theorem 12.11 In section [BJ we compute the entropy of a Bernoulli shift 
over a sofic group with respect to a sofic approximation and use it to prove proposition 12. 2\ 
theorem 11.11 corollary 11.21 and theorem 11.41 



3 Classical Entropy Theory in Brief 

To prove the above results, we will need some basic facts from classical entropy theory. An 
expert could skip this section, referring back to it for notation if necessary. Fix a probability 
space (X, yu). 

Definition 17. Let T be a cr-algebra contained in the cr-algebra of all measurable subsets 
of X. Given a partition a of X, define the conditional information function I(a\!F) : 
X^Mby 

I{a\F){x) = -\og( l i{A*\F)(xj) 

where A x is the atom of a containing x. Here, if A C X is measurable then /a(A\ J 7 ) : X — ► R 
is the conditional expectation of xa, the characteristic function of A, with respect to the 
cr-algebra T . In other words, it is an jF-measurable function such that for all jF-measurable 
functions / : X — > R, 



n(A\J r )(x)f(x)dix(x)= I XA(x)f(x)djd(x). 

X J X 

The conditional entropy of a with respect to JF is defined by 

H(a\J r )= [ I(a\F){x)diJt(x). 



Jx 

For simplicity, let H(a) = H(a\{X,$}) and I{a) = I(a\{X,$}). 

If (3 is a partition then, by abuse of notation, we can identify (3 with the cr-algebra equal 
to the set of all unions of partition elements of (3. Through this identification, I(a\j3) and 
H (a |/3) are well-defined. 

Lemma 3.1. For any two partitions a,/3 and for any two a-algebras T\,Ti with T\ C T2, 

H(aV0) =H(a)+H(p\a), 

H(a\F 2 ) < #(cv|.Fi) 

with equality if and only z//i(y4|jF 2 ) = /i(y4|jF x ) a.e. for every A G a. In particular H(a\ (3) < 
H(a) and equality occurs iff a and [3 are independent (i.e., WA e a,B e /3, fj,(AC\ B) = 
li{A)n(B)). 
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Proof. This is well-known. For example, see [G103, Proposition 14.16, page 255]. □ 

4 The space of partitions 

In order to prove theorem 12.11 it is necessary to develop a general theory of the space of 
all partitions of a given probability space (X, /x) on which G acts by measure-preserving 
transformations. In the case of finite partitions and finitely generated groups, the required 
results were proven in [Bo08a]. So fix a countable group G. Let (G,X,/a) be a G-system. 

Definition 18. Two measurable partitions a, (3 of X are equivalent if for every A £ a 
there exists a B G /3 such that fi(AAB) = 0. Let V denote the set of equivalence classes of 
measurable partitions a of X with H(a) < oo. Generally speaking, we will abuse notation 
and consider elements of V as partitions themselves. This is similar to the common abuse of 
considering an element / G L p (X,fi) to be a function when it is really an equivalence class 
of functions. 

Definition 19 (Rohlin distance). Define d : V x V — ► R by 

d(a, (3) = H(a\/3) + H(j3\a) = 2H(a V p) - H(a) - H(J3). 

By [Pa69, theorem 5.22, page 62] this defines a distance function. 

G acts isometrically on V by ga = (gA±, gA 2 , . . . ) where g G G and a = (A±, A 2 , . . . ) G V . 
I.e., if g G G, a, (3 G P then d(ga, g/3) = d(a, (3). 

Definition 20. Let S C G. The left-Cayley graph T of (G, S) is defined as follows. The 
vertex set of T is G. For every s G S and every g G G there is a directed edge from g to sg 
labeled s. There are no other edges. S generates G if and only if T is connected. 

The induced subgraph of a subset F C G is the largest subgraph of V with vertex 
set F. A subset F C G is S-connected if its induced subgraph in V is connected (as an 
undirected graph). 

Definition 21. Let a and j3 be partitions. If, for every atom A G a there exists an atom 
B G j3 such that fi(A — B) = (i.e., A is contained in B up to a set of measure zero) then 
we say a refines (3. Equivalently, (3 is a coarsening of a. This is denoted by (3 < a. We 
will often abuse notation by writing A G B to mean jjl{A — B) = 0. 

Definition 22. If F C G is finite, and a eV, let 

a F = \f ga. 

Partitions a, (3 G P are 5-equivalent if there exists finite 5-connected sets F-±, F 2 C G such 
that e G F 1 H F 2 , a < j3 Fl and /5 < a Fi . If S,a, j3 are all finite and 5 generates G then this 
is equivalent to the definition of combinatorial equivalence given in [Bo08a]. 
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To check that this defines an equivalence relation, let a, (3, 7 G V and let A,B,C,D C G 
be finite ^-connected sets containing the identity element such that a < j3 A , f3 < a B , (3 < 7 C 
and 7 < j3 D . Then 

a < P A < (l°) A = 1 AC 

where AC = {ac \ a G A, c G C}. Similarly, 7 < a DB . An easy exercise shows that AC and 
DB both contain the identity and are S'-connected. Thus a is ^-equivalent to 7. This shows 
that S'-equivalence is an equivalence relation on V . 

The main result about S'-equivalence is: 

Theorem 4.1. Let S G G generate G. If a, (3 G V are generating partitions of the sys- 
tem (G,X,fi), then for every e > there exists a' G V such that a' is S-equivalent to a 
and d(a',(3) < e. I.e., the S-equivalence class of a is dense in the space of all generating 
partitions. 

We will first need some lemmas. 

Lemma 4.2. If F C G is finite then the function a 1— >■ a F is continuous on V. 

Proof. Let V F = Y\t eF V be the product space. Define : V — ► V F by 0(a) = (fa)f eF - 
Define -0 : V F — ► "P by ^((a/)/^) = V/gf a /- Then the function a 1— ► a F is the composition 
of and -0. 

For fixed g G G, the map a 1— ► (?a is continuous on V. So is continuous (where P F 
has the product topology). It is easy to see that -0 is also continuous. For example, if 
a, a', 0, P' G Pthen 

d(a V (3, a' V /3') = H(a V /3|a' V/3') + H{a' V /3'|a V/3) 

< #(a|a' V /3') + //(/31a' V /?') + /J(a'|a V (3) + H([3'\a V /3) 

< H(a\a) + #(/3|/3') + H(a'\a) + #(/3'|/3) 
= d(a,a')+d(/5,/5')- 

Similarly, if (af) feF , (Pf)feF G V F then d(\f feF a f , \/ feF Pf) < J2feF d ( a f> Pf)- This proves 
that a I— > a F is continuous. D 

Lemma 4.3. Let a £ V be a generating partition. Let (3 eP and e > 0. Let S G G generate 
G. Then there exists a finite S-connected set F G G and a partition 7 G V such that 7 < a F 
and d((3, 7) < e. 

Proof. This is an easy exercise left to the reader. □ 

Lemma 4.4. Let a G V. Then for every e > t/iere exists a 5 > suc/i #ierf z/c<j = {X^, X5} 
is a 2-atom partition of X with n(Xg) < 5 and £ is the partition 

£ = {X L } U{X s nA\Aea}, 

then H(£) < e. 
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Proof. Let e > 0. Let 5 > be small enough so that if Y C X is any set with fi(Y) < 5 and 
if uo = {X - Y, Y} then 

2H(u) + / 1(a) dfi < e 

where 1(a) is the information function of a. 

Now let oo = {Xl,Xs} with 5 > fi(Xs) and let £ be as above. Consider the information 
function f(£). Note that f(£) (x) = f(o;)(x) if x G X L and f(£) (x) = I(a V w)(x) if x 6 X 5 . 
It follows from the definition of the information function that I(a V u) = 1(a) + I(uj\a). 
Hence 



if (f ) = / /(w) d// + / f (a V uo) dfi 

J x L J x$ 

< H(lo)+ I(a) + I(uo\a) dfi 

Jx s 

< 2H(uo) + J 1(a) dfi < e. 

JXa 



We can now prove theorem 14.11 



Proof of theorem 4-1 Let 1 > e > 0. Let S > be as in the previous lemma. By choosing 5 
smaller if necessary, we may assume e~ 5 > 1/2 and e > S. 

Because S generates G, the Cayley graph T of (G, S) is connected. Since (3 is generating, 
there exists a finite S-connected set F C G such that e G F and H (a\[3 F ) < 5 2 /2. Let 82 be 
a number with S 2 /2 > 5 2 > 0. By lemma H~3l there exists a finite ^-connected set K C G 
and a partition 7 G V such that e G fT, 7 < a^ and d(j,j3) < ^. By lemma I4.2[ the 
function a 1— > a F is continuous. Hence, by choosing <5 2 smaller if necessary, we may assume 
that d(-f F ,(3 F ) <5 2 /2. So, 

if(a| 7 F ) = H (a V 7 F ) - H ( 7 F ) 

< ff(a|7 F V/5 F ) + ff(7 F V/3 F )-ff(7 F ) 

< ff(«|/? F ) +ff (/5 F |7 F ) 

< if(a|/? F ) + rf(/? F , 7 F )<5 2 . 



X L = {sGX I f(a| 7 F )(x) <5}. 



Let 

Let X 5 = X — X L . Since 

5 2 > H(a\ 1 F ) =- I f(a|7 i '')(x) d//(x) > /i(X 5 )5, 



/-t(X-j) < <*• Le t 00 = {X L , X s }, £ = {X L } U {X 5 n A \ A G a} and A = 7 V f . We will show 
that A is ^-equivalent to a and d(X, f3) < 2e. 
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Since the information function I{a\ r ) F ) is constant on each atom of a\/ , y F , Xi is a union 
of atoms of a V 7 F < (a K ) F = a FK where FK is the product FK = {fk : f E F,ke K}. 
Note that FK is S'-connected and e £ FK. So uj = {Xl,Xs} < a FK . This implies 
^<«Vw< a FK . Hence A = 7 V f < a Fi ^. 

On the other hand, a < , y F V £. To see this, let A £ a. Then 

A= (Anx L )u(Anx s ). 

Since An As £ £, it suffices to show that AnX^ is a union of atoms in 7 F V£. By definition, 
A PI Xl is the union of A fl C over all atoms C £ 7 F such that fj,(A\C) > e~ 5 > 1/2 where 
//(A|C) = ^gp. But if fi(A\C) > 1/2 and if A 2 £ a also satisfies /i(A 2 |C) > e" 5 > 1/2 
then A 2 = A. Thus AnC = X L nC. So A n X L is the union of X L n C over all atoms 
C £ 7 F such that yu(v4|C) > e^* 5 . This proves that A fl Xl is the union of atoms in 7 F V £. 
Thus, a < 7 F V £ as claimed. Since A = 7 V £, this implies a < A F . Because A < a FK , a is 
S'-equivalent to A. 

It remains to estimate d(\,/3). 

d(\,/3) = H(\\p)+H(J3\\) 

< H(jVZ\(3)+H(P\j) 

< H(S\p) + d(T(,P) 

< H{£) + S 2 /2 < e + S 2 /2. 

The last inequality comes from the definition of 5 at the beginning of this proof. Since 5 < e 
and e is arbitrary, this proves the theorem with a' = A. □ 

4.1 Splittings 

In order to apply theorem 14. 1[ we will show that if a, ft £ V are S'-equivalent then they have 
a common "S-splitting" as defined next. 

Definition 23. Let a £ V be a partition. A simple S-splitting of a is a partition o of 
the form a = a V sft where s £ S and ft is a coarsening of a. 

An 5-splitting of a is any partition a that can be obtained from a by a sequence of 
simple S'-splittings. In other words, there exist partitions cto, a%, . . . , a m such that ao = a, 
a m = a and a.- l+ i is a simple 5-splitting of «j for all < i < m. 

Lemma 4.5. If ft is an S -splitting of a £ V then a is S -equivalent to ft. 

Proof. It suffices to consider the special case in which ft is a simple ^-splitting. Then 
a < ft < qV tat = a>{ e,t } for some t £ S. Since {e, t] is S'-connected, this proves it. □ 

Lemma 4.6. Let S C G. If a, ft £ V, a refines ft and F G G is finite, S-connected and 
contains the identity element e then a V ft F is an S -splitting of a. 
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Proof. We prove this by induction on \F\. If \F\ = 1 then F = {e} and the statement is 
trivial. Let fo^F — {e} be such that F\ = F — {f } is S-connected. To see that such an f 
exists, choose a spanning tree for the induced subgraph of F. Let /o be any leaf of this tree 
that is not equal to e. 

By induction, a.\ := a V j3 Fl is an S-splitting of a. Since F is S-connected, there exists 
an element f\ G F x and an element S\ & S such that si/i = f . Since /i G F 1; _/i/3 < «i. 
Thus 

a V /? F = ai V f (3 = ai V si(/i/3) 

is an S'-splitting of a. □ 

Proposition 4.7. Lei S C G. Let a, (3 G "P fre S-equivalent. Let a,j3 be S-splittings of a,j3 
respectively. Then there exists a partition 7 G V that is an S -splitting of a and an S -splitting 
of P. 



Proof. By lemma H~5l a and (5 are S-equivalent. So it suffices to prove that there is a partition 
7 G V that is an S'-splitting of a and an S-splitting of j3. Let F, K be finite S-connected sets 
containing the identity such that a < (3 F and f3 < a K . Thus a K < (/3 F ) K = j3 KF . Since /3 
is a coarsening of a^ and KF is S-connected and contains the identity, the previous lemma 
implies 7 = [3 KF is a splitting of a^, and therefore, is a splitting of a. Of course, /3^ F is 
also a splitting of /?. □ 

The next theorem explains how we will use this general theory to prove that h(E, a) does 
not depend on the choice of generating partition a. 

Theorem 4.8. Let G be a countable group and suppose S C G is a generating set for G. Let 
{G,X,ji) be a G-system. As above, letV be the space of partitions of X with finite entropy. 
Let f : V — > RU{— 00} be an upper semi- continuous function such that f is invariant under 
S-splittings (i.e., if a is a S-splitting of [3 G V then f(a) = f(ft))- Then for all generating 
partitions a, (3 G V, /(a) = f{@)- 

Proof. Let a, (3 G V be any two generating partitions. By theorem 14.11 there exists a 
sequence {o4} of partitions converging to /3 such that each a[ is S-equivalent to a. By the 
previous proposition, a[ and a have a common S-splitting. So f{ot! j ) = f(a) for all i. Since 
/ is upper semi-continuous, 

f{a) = lim /(o4) < /(/?). 

1— >oo 

The opposite inequality, f((3) < /(a), is similar. This proves the theorem. □ 

In the next section, we prove the inequality H(T,,j3 : F) — H({3) > H(E,a : F) — H{at) 
when a > (3. This will be useful in succeeding proofs. In section |6] we show that H(E, a : F) 
is well-defined when a is infinite and that h(E, •) is upper semi-continuous. In section [3, we 
show that /i(S, •) is invariant under S-splittings. Theorem 14.81 then implies theorem 12.11 
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5 A lower bound for the entropy of a coarsening 

The proposition succeeding the two lemmas below will be used frequently in this paper. 

Lemma 5.1. Let (G,X,fi) be a system with G a countable group. Let E = {ctj} be a map 
sequence for G (i.e., a sequence of maps o~i : G — > Symijrii) with rrii — > oo as i — > 00/ Let 
a, j3 be finite partitions of X and suppose that a refines [3. Then for all finite F C G, 

H(E, 13: F)- H(J3) > H(E, a : F) - H(a). 

Proof. Let a = (Ai, A 2 , . . . , A u ) and (3 = (B\, B 2 , . . . , B v ). Let e > 0. Let a : G — ► Sym(m). 
We will obtain an upper bound on the cardinality of AT (a, a : F,e) in terms of \AV{a, (3 : 
F,e)\. 

Let b : N — > N be the map b(i) = j if Ai C Bj. Define the coarsening map $ : ACPio, a : 
F, e) ->_AV(a, (3 : F,_e) as follows. If a = (Ax, . . . , A\) then let $(a) = J3 = (B X ,...,B V ) 
where Bj = Ut-6(i)=i^- We nee d to check that dp(f3,f3) < e as claimed. 

Let ( be the uniform probability measure on {1, . . . ,m}. As in definition [HI given a 
function <p : F -> N, let 5^ = f| /eF /-B^/) and 1^ = H/eF^/)-^/)- Define A^, A 
similarly. 

Note that B^ = U^A^ where the union is over all if) : F — > N such that A^ C B$. 
Similarly, B^ = U^A^. Therefore, 

d F {PJ) = Yl |M^)-C(^)|< Yl H^)-({A 1 p)\=d F {a,a)<e. 

This shows that J3 G AT (a, f3 : F, e) as claimed. 

Let J3 = (Bi, . . . ,B V ) G AV(o~,(3 : F, e). We will bound the cardinality of the inverse 
image $ _1 (/?). 

For a vector u = (ui, v 2 , ■ ■ ■ , v u ) G N u , let A(/? : v) be the set of all a = (A\, . . . , A u ) G 
$ -1 (/3) such that \Ai\ = Vi for all 1 < i < u. A partition a G A{j3 : u) is obtained from (3 by 
subdividing each partition element Bj into sets of cardinality t>j for fo(z) = j. Therefore, 



w^)i<(ni£ii!)(n 



1 

Tfcl 1 



j=l i=l 

If a G A((3 : u) then, 



J2 HA) - C(A)| = 5Z M4) ~ ^/ m ' ^ dp ^ ") ^ e - W 



8=1 



So fj ~ fi(Ai)m. Stirling's approximation implies that there is a function 5 
(depending only on a and j3) such that if m is sufficiently large then 



\A(P:v)\<ex V ((H(a)-H(l3) + 5(e)) 



in 



and lim e ^ 5(e) = 0. 

By equation [TJ the number of vectors v G N" such that A(f3 : v) is nonempty is at most 
(3me) u . Thus, 



{«|^(/3:-!/)^0} 

< (3me) u exp ((#(«) -#(/3) + 5(e)) m 



l* -1 ^)! 



Since /3 G APfV, /3 : F,e) is arbitrary, this implies that if m is sufficiently large, 

.A7V, a:F,e) < (3me) u exp ( (H(a) - H(J3) + (J(e))m) AV(a, P:F,e) 

Since a is arbitrary, the definition of F"(£, a : F,e) yields: 

H(E,a: F, e) < H(a) - H({3) + 5(e) + H (E, : F, e). 
Now take the limit as e — ► to obtain: 

#(E, a:F)< H(a) - H(j3) + H(E, (3 : F). 
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Lemma 5.2. Lei a G V. Let {a n } be a chain of a (as in definition [731) . Lei S 6e a map 
sequence of G. Let F G G a finite set. Then lim n ^ooLf(S, a n : F) exists. Moreover, if {f3 n } 
is another chain of a with (3 n > a n for all n then 

lim H(E,/3 n : F) < lim H(E,a n : F). 

n— +oo n— >oo 

Proof. By lemma 13.11 and the previous lemma, for each m > n > 1 , 

#(E, a m : F) < tf (E, a n : F) + H(a m \a n ). 
Take the limsup as m — > oo to obtain 

lim sup H (E, a m : F) < Lf(E, a ra : F) + Lf(a|o; n ) 

■m^oo 

for every n > 1. Take the liminf as n — > oo to obtain 

limsupLf (E, a m : F) < liminf Lf(E, a n : F). 

Here we are using H(a) < oo which implies H(a\a n ) tends to zero as n — > oo. Hence the 
limit, lim n ^ 00 F'(E, a n : F), exists. 

If {fin} is another chain of a with /3 n > a n for all n then 

#(£, A, : F) < F(E,a n : F) + #(/?„K). 

Since F"(/3 n |a n ) < H(a\a n ) — >• as n — > oo, this implies that 

lim #(£,/?„ : F) < lim F(E, a n : F). 

□ 
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We can now remove the finiteness hypothesis in lemma 15.11 

Proposition 5.3. Let G be a countable group, E a map sequence of G and (G,X,n) a 
G-system. Let a, (3 G P and suppose that a refines (3. Then for all finite F C G, 

F(S, 13: F)- H((3) > H(E, a : F) - H(a). 

Proof. Let {a n }, {f3 n } be chains of a and (3 respectively. It follows from lemma [57T1 that 

if (£, (3 n :F)- H((3 n ) > if (S, a n V ft, : F) - if (a n V /?„) 

for all n. Take the limit as n — > oo to obtain 

f lim ff(E, /3 n :F))- if (£) > f lim if (E, a n V & : F)) - H (a) 

> H{E,a:F)-H{a). 

Since this is true for every chain {/?„} of /?, the result follows. D 

6 Upper semicontinuity 

Proposition 6.1. Let (G,X,fi) be a system. Let E = {o - ,:}^! &e a sequence of maps o~i : 
G — > Sym(mi) with rrii — > oo as i — > oo. TTien /or any /invite F G G the map H (E, ■ : F) : 
P^RU {— oo} is upper semi- continuous. 

Proof. Let a6? and let {,5*}°^ cPbea sequence converging to a. It suffices to show that 
lim sup j if (S,/3* : F) < H (E, a. : F). Because ff(E,/3 l : F) does not depend on the ordering 
of the atoms of j3\ we may assume, without loss of generality, that if a = (Ax, A 2 , . . . ) and 
/? = (B{,B l 2 ,...) then 

lim fxiAjAB]) = 

for all j. 

Let us now assume that (3 l = (B{, . . . , B l u ) and a = (Ai, . . . , A^) are finite partitions 
with the same number of atoms. As in definition [HJ given a function <fi : F — > N, let 

^ = D/eF fMf) and 5 = D/gf f Bi 4>uy Define 

Then aV(a,/3 J ) — * as « — > oo since (3 l — > a (see lemma H~2j) . An elementary computation 
shows for any map a : G — > Sym(m), any e > and any i > 0, 

AP(a, a : F, e + d F (a, /5 4 )) D AP(a, ft : F, e). 

Therefore if c > 1 is arbitrary, 

ff(E,a: F,cd F (a,^)) > ff(E,ft : F). 
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Take the limsup as % tends to infinity to obtain 

H(X,a : F) > lim sup ii (£,/?* : F) 

as claimed. 

Now let j3 l = (B\,B\,...) and a = (Ai, A2, . . .) be infinite partitions. It is allowed 
that some of the atoms of a and/or /3 1 are empty. So this case includes the finite case. Let 
{a n } c ^ =1 be a chain of a. For each n, there exists a finite partition n n = {Pi, n , ■ ■ ■ , Pc n ,n} of N 
such that every atom of a n is of the form Uj e p fc n A? ^ or some k- Let (3 l n be the corresponding 
coarsening of j3 l . That is, let (3 l n be the partition whose atoms are all of the form lL e p ^j 
(for 1 < k < c n ). 

Proposition 15.31 implies that 

#(£,/? : F)-H(J?\P n ) < H{E,ft : F) 
for all i,n. The previous case implies 

limsup#(E,/3* : F) < H(E,a n : F) 

i 

for all n. Thus, 

limsup/7(S,/5 i : F) < H(£, o^ : F) + limsup H(J?\/%) = H(J:,a n : F) + H(a\a n ). 

i i 

Since this is true for all n and for every chain {a n } of a, it follows that 

lim sup if (£, /? : F) < #(E, a : F) 

as claimed. D 

Proposition 6.2. Let a G V and {a n } a chain of a. Let E be a map sequence of G. Let 
F C G a finite set. Then 

H(E,a:F)= lim H(E,a n :F). 

n— >oo 

Proof. It follows from the definition that fi(E, a : F) < lim„^ 00 ii(S, a n : F). Because 
a n — > a as n — > 00, the previous proposition implies that if(E, a : F) > lim n ^ 00 ii(ZI, a n : 
F). U 

Corollary 6.3. Let (G,X,fi) be a system. Let £ be a map sequence of G. Then the map 
h(Y,, •) : V — > K U {—00} zs upper semi- continuous. 

Proof. By definition /i(£, •) = infpif (E, • : F) where the infimum is over all finite F C G. 
Since an infimum of upper semi-continuous functions is upper semi-continuous, proposition 
16. II implies this corollary. □ 
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7 Monotonicity 

In this section, we prove that H (E, a : F) is monotone decreasing under 5-splittings. We 
handle the finite partition case first. 

Proposition 7.1. Let F C G be finite and suppose e G F. If a is an F -splitting of a finite 
partition (3 G V and £ = {ai\ c *L 1 is a sequence of maps o~i : G — > Sym(mi) with rrii — » oo 
thenH(£,a:F) < H(E,p-. F). 

Proof. Intuitively, the proposition is true because any approximation to j3 on {1, ... , m} can 
be split into an approximation of a and approximately all of the approximations to a are 
obtained this way. The proof is a matter of making this intuition precise. 

Let 1/4 > e > and a : G — > Sym(m) . We will obtain an upper bound on the cardinality 
of AV(a, a:F,e) in terms of \AV(a, (3 : F, e) |. 

It suffices to consider only the special case in which a = (Ai, . . . ,A U ) is a simple in- 
spiriting of (3 — (Bi, . . . , B v ). So there exists / G F and a coarsening £ = (X l5 . . . , X w ) of /3 
such that a = /? V /£. 

Let 6, x : {1, . . . , it} — > N be the maps defined by 

Ai = Bb(i) H fX x (i). 

Define the "coarsening" map $ : AV(o~, a : F, e) — > AV(o~,/3 : F, ej as follows. For 
a = (Ai, . . . , A,) G AP(or, a : F, e), let $(a) =(3= (B h ..., B v ) where B j = {j m)=j A { . As 
in the proof of lemma [57T] dpi/3, (3) < e so (3 G A.P(cr, j3 : F,e) as claimed. 

Next, we obtain an upper bound on the cardinality of $ _1 (/?) where /? G v4P(cr, ft : F,e) 
is a fixed partition. This bound will not depend on the choice of (3 and thus, we will be able 
to use it to bound \AV(a, a : F, e)|. In order to obtain the bound, we will show that every 
partition in $ _1 (/5) is "close" to a particular partition which we construct directly from /3 
next. 

Let t : {1, . . . , v } — ► N be defined by B t C X t{i) . Define £ = (Xi, . . . , X w ) by 

i:t(i)=j 

Let a = (Ai, . . . , A u ) be defined by 

Ai = -Bfe(j) fl cr(f)X x (i). 

Let C be the uniform probability measure on {1, . . . , m}. Everything we need to know 
about a is contained in the claims below. 

Claim 1: if 5 = (D u . . . , D u ) G <$>-\P), then 



<(U 



u 

DiAAi ) < e. 
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Claim 2: if 8 is as above and z G A — A^ for some i then z G A l~l o~(f)Dj for some 
1 < j < « such that A; n /A,- = 0. 

Let us see how claim 2 implies claim 1. First note that [X=i AAA* = lX=i A — Aj. To 
see this, let z G Ui*=i AAA*. Then there is an j such that z G DjAAj. If 2 G A — Aj then 
z G ULi A — Ai- Otherwise z G Aj — Dj. Since 5 is a partition, there is a /c 7^ j such that 
z G A- But then z G A — A& since A& does not intersect Aj. Since z is arbitrary, this 
shows that U" =1 AAAj C UlLi Di — Ai. The opposite inclusion is trivial. 

Claim 2 implies that 

w w w 

c(UAA^)=c(UA-^i)<X;|A*(A i nM : ,-)-c(An<T(/)£> J -)| < d F (M) < e- 

This proves claim 1. To prove claim 2, let z G Di — A t for some 1 < % < u. By definition, 
Ai = B b(i) n a(f)X x{ i). Since $(5) = /3, z G A C A (i) . So, it must be that z ^ a(f)X x{i) . 
Let j be such that z G a(f)Dj C a(f)B^j) C o~(f)X t (b(j))- Note t(b(j)) 7^ x(i). 
By definition, 

^ n /Aj- c /Xr(i) n fx t (p(j)). 

Since t(b(j)) 7^ x(i), -^t(b(j)) H X^) is empty. So Aj n /Aj is empty. This proves claim 2. 

It follows from claim 1 that any partition in $ _1 ( / 5) can be obtained from ct by "rela- 
beling" a subset of {1, . . . , m} of cardinality at most em. That is, if S G $ _1 (/3), then there 
exists a set of cardinality n = \em\ in {1, . . . , m} such that 5 can be obtained from a by 
redefining a on this set. Since e < 1/4 this implies 

If m is sufficiently large then Stirling's approximation implies 

|$ _1 (/5)| < exp(if(2e,l-2e)m + 2emlog(u)) 

where H(x,y) = — xlog(x) — y\og(y). Since (3 G AV(a,P : F, e) is arbitrary, this implies 
that if m is sufficiently large, 

\AV{cr,a :F,e)\ <exp(H(2e,l-2e)m + 2emlog(u))\AV(<T,/3: F,e)\. 

Thus, 

H (E, a : F, e) < if (2e, 1 - 2e) + 2e log(u) + i? (E, /3 : F, e) . 

Now take the limit as e — > to obtain if (E, a : F) < if (E, /3 : F), as claimed. □ 

Next, we remove the finiteness assumption in the above proposition. 

Proposition 7.2. Let F G G be finite and suppose e G F '. If a is an F -splitting of a 
partition (3 G V and E = {o~i} c *^ 1 is a map sequence then if (E, a : F) < if (E, f3 : F). 
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Proof. It suffices to assume a is a simple F-splitting of j3. So there is an / e F and a 
coarsening £ of /3 such that a = /3 V /£. 

Let {/3 n }, {£„} be chains for /3 and £ respectively such that £ n coarsens /? n for all n. Then 
{Pn V /£«} is a chain for a. The previous proposition implies 

F(E,/? n V fZ n :F)<H(V,p n :F) 

for all n. Proposition 16.21 now implies that H(Y},/3 V /£ : F) < H(E,f3 : F) as claimed. D 

Proposition 7.3. Let S G G. Let H be a sofic approximation to G. If a is an S-splitting 
of(3eV then /i(E, a) = h(X,(3). 

Proof. We may assume that a is a simple S-splitting of (3. So there exists ant 6 S and a 
coarsening £ of (3 such that a, — {3 V t£. Note that /3 V t/3 is an S-splitting of a by lemma 
14.61 So the previous proposition implies 

hfapvtp) < h(E,a) < h(E,/3). 

So it suffices to show that h(E, (3 V t/3) > h(E, (3). 

Let us assume for now that (3 = (Bi, . . . , B v ) is finite. Let a := f3\/tj3 = (A\, A 2 , . . . , A u ). 
Let x,y : {1, . . . , u} — > N be maps determined by Aj = B x ^ fl tB y ^y We assume that the 
map z i— > (x(i),y(i)) surjects onto {1, . . . ,v} x {1, . . . ,v}. Thus some of the A^s may be 
empty. 

Let F C G be a finite set with e,t & F and let e, <5 > 0. Let a : G — > Sym(m) be an 
(F, (^-approximation to G. Let V(F) C {1, . . . , m} be the set of all elements v such that for 
all/i,/ 2 eF, 

o-(fi)o-(f 2 )v = a(fif 2 )v 

and <r(fx)v ^ <?{f2)v if f\ ^ f 2 . Since a is an (F, (^-approximation, |V(F)| > (1 — S)m. 
Define 

^ :AV(a,f3:FU Ft, e) ^ AV{a } [3V tf3 : F } e + 5\F\5) 

by #(/?) = a = (Ii, . . . , A u ) where if = (B 1? . . . , S„) then ^ = B x{i) n <r(t)S v(i) . Note 
that \l/ is injective. 

It is implicitly claimed above that dp(oj, \P(/3)) = dp (a, «) < e + 5|F|<5. Let us check this. 
As usual, for : F — > N set A^ = f] feF fA^fy Similar formulas apply to A^ and to B^, B^ 
for ip : F U Ft -> N. 

Let £ be the set of all functions : F — » N such that for every / e F with /£ e F, 
x((j)(ft)) = y((j)(f)). Observe that if £ <? and / G F is such that /t e F but x(<j>(ft)) j- 

y(0(/)) then 

A^ c ftAwt) n /v4 0(/) c ftB x{4>(ft)) n ftB ym)) = 0. 

Similarly, 

^ C a(ft)B x{m)) n a(f)a(t)B mf)) C (x( fj(x(t)\/(F) c 
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where V(F) C denotes the complement of V(F). Since ((V(F) C ) < 5 and {A^} is pairwise 
disjoint, 

d F (a,a)= J2 JM^)-C(^) < \F\8 + J2\v( A <p) ~ C( A <I>) 

On the other hand, if G Q then define : F U Ft — ► N as follows. For f £ F, let 
0(/) = x(<f)(f)) and </>(/£) = y((f)(f)). This is well-defined by the definition of £. Observe 
that 

^ = n ^(/) = n f B <<t>u)) n f tB ywf)) = n ^co = s <?- 

feF feF feFuFt 

It is almost true that A^ = B$. If a is not a homomorphism then this equality can fail. 
To be precise, observe that 

h = PI a (/)^(/) = H a U)B*<W) n *(■/>(*) A/W/))> 

feF feF 



while 



Hence 



% = n *(/)**(*/)) n ^(/*)^ : 



»(«/))■ 



/sf 



A A% C |J a(/t)\/(F) c U <x(/)a(£)\/(F) c 

where V(F) C denotes the complement of V(F). 

Each of the collections {A^ — B^^g and {B^, — A^^g is pairwise disjoint. Thus, 

J2 |^A^| < 2| |J a(ft)V(F) c U <x(/V(£)\/(F) c | < A\F\5m. 
<t>eQ feF 

This implies £^g |C(^) - C(%)| < 4|F|*. So, 

dj?(a,a) = J^ |/i(^)-C(^) 

< |F|5 + ^|/i(^)-C(^) 

< \F\5 + Y, |/^) - C(%)| + JC(^) - C(^ 

< d FuFt ((3j) + 5\F\5<e + 5\F\5. 

This proves the claim. 
Since \& is injective, 

„4P(a,/3:FUF£,e) < AP(a,P V t/3 : F, e + 5|F|<J) 
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Since S = {a{\ is a sofic approximation, each <jj is a (F, (^-approximation to G for some 
5i > with 5j — > as i — > oo. Thus if c > 1 is arbitrary then 

H(E,p-.FUFt,e) < Hfapvtp : F,ce). 

Let e — > to obtain 

H(E : P:FUFt)<H(E 7 (3Vt/3:F). (2) 

Finally, take the infimum over all finite sets F C G with e, £ G -F to obtain 

h(£,/3) < hfapvtp). 

This finishes the proof in the case that /3 is finite. 

Now suppose that (3 = (B\, . . . ) e V is a possibly infinite partition. Let {/? n } be a chain 
of /?. For every finite F C G with e,t & F, equation [2] implies 

F(E, (3 n :FUFt)< tf (E, ^ V t/5 n : F). 
Since {/3 n V t/5 n } is a chain for (3 V t/3, proposition 16.21 implies 

H(£, j3:FUFt)< H(E, pvtp-.F). 
Now take the infimum over finite sets F C G with e,t & F to obtain 

h(E,P) < h{V,PVtp). 
This finishes the proof. □ 

We can now prove theorem 12.11 

Proof of theorem \2.1[ By corollary 16.31 h(E, ■) : V — > R U {— oo} is upper semi-continuous. 
By the previous proposition, h(E, ■) is invariant under splittings. The theorem now follows 
from theorem 14.81 □ 



8 Bernoulli shifts over a sofic group 

In this section, we calculate the entropy of a product system in which one of the factors is 
Bernoulli. To be precise we need the following. 

Definition 24. Let (G, X, //) and (G, Y, v) be two systems. Then the product system 

(G, IxFj/iX!/) is defined by g(x, y) = (gx, gy). If a is a partition of X and f3 is a partition 
of Y then a x (3 := {A x B \ A e a, B e {3} is a partition oflxV. 
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Theorem 8.1. Let G be a group with sofic approximation E. Let K be a finite or countably 
infinite set and let k be a probability measure on K such that H(k) < oo. Let (3 be the 
canonical partition of K . I.e., (3 = {Bk | k G K} where B^ = {x G K G \ x(e) = k}. 

If (G, X, /i) is any G-system and a is a partition of X with H(a) < oo ; then for every 
finite F C G, 

#(£, a x 0:F)= H(E, a : F) + H(k). 

This implies: 

/i(£, G, X x K G , fj, x k g ) = /i(£, G, X, /i) + H(k) 

if (G, X, /i) has a finite- entropy generating partition. In particular, /i(£, G, K G , kF) = H(k) . 

Proof. We will first obtain the upper bound. Let r = {K G } be the trivial partition of K . 
Proposition 15.31 implies 

F(S, a x r : F) - H(a x r) > H(E, a x /? : F) - H(a x /3). 

It is a standard exercise to prove that H(a x (3) = H(a) + H{(3) = H(a) + H(n). Since 
H(a x r) = H(a), 

H{E, axf3:F)< H{E, a : F) + H{k). 

This proves the upper bound. 

Observe that, by taking coarsenings of a and j3, proposition 16.21 implies that we may 
assume, without loss of generality, that a and (3 are finite. Of course, this means that K is 
finite. So let p be the number of atoms of a and let q = \K\ be the number of atoms of (3. 

To begin, let e, 5 > 0, be such that 25 < (pg) _ ' F 'e. Let a : G — > Sym(m) be an (F, 5)- 
approximation to G. Let V(F) C {1, . . . ,m} be the set of all elements v such that for all 

/i,/ 2 eF, 

o-(fi)o-(f 2 )v = a(f 1 f 2 )v 

and a(f x )v ^ a(f 2 )v if h £ f 2 . By definition, \V(F)\ > (1 - 5)m. 

Let K = {1, . . . , q}, B { = {x_ G K G \ x(e)_ = i\, (3 = (B u ...,B q ). Identify K m with 
the set of all ordered partitions (3 = (B t , . . . , B q ) of {1, ... , m} via the map (xi, . . . , x m ) G 
K m ^ (B 1 ,..., B q ) where 

Bi = {j G {l,...,m} | Xj = i}. 

Let K m be the product measure on K m , f3 = (B~i, . . . , B q ) be a random element of K m 
with law K m and a = (Ai, . . . , A p ) G ^.^(cr, a; : F, e). We will estimate the probability that 
<iir(a x (3,a\/ J3) < 2e. Of course, di?(a x /3, a V (3) depends on a choice of ordering of the 
two partitions. But we claim that the partitions a V J3 and a x /3 can be ordered canonically 
so that 

d F (ax(3,a\/P)= J2 |/iX/t G (^x^)-C(^n^) . (3) 
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A word about the notation used above is in order. As in definition [HI if <fi '■ F — > N then 
A^ = H/eF fA(j>{f), A<t> = f] f€F a(f)A^f) and similar formulas hold for B^, B^. Note that 

n ^(^W) x B i<n) = A <t> x ^V" 

/GF 

f|/(^ (/) n^ )(/) )=^n^. 

This justifies equation [31 

In order to estimate the probability that dF(a\//3, ax j3) < 2e, fix functions 0, tp : .F — *■ N. 
For each t> G {1, . . . , m}, let Z„ equal 1 if v G V"(F) fl A^ fl .B^, and otherwise. Let 

z = Y / ™ =1 z v = \v(F)nA (t) r)B^\. 

If f ^ V"(F) fl Acf, then the expected value of Z v , denoted E[Z„], equals 0. Otherwise, 
since a{f x )v ^ a(f 2 )v for v G V{F) if /i ^ / 2 G F, it follows that E[Z„] = « G (.B^). Thus 

m 

e[|v(f) n^n s^|] = ^e[z„] = « g (b^)|v(f) n A^. 

v=l 

To estimate the variance of Z, denoted Var(Z), we will first bound K[Z V Z W ] for 1 < v, w < 
m. If either w or to is not in V(F) fl A^ then W,[Z V Z W ] = Z V Z W = 0. If (j(fi)v ^ o-{f 2 )w 
for any /i,/2 G F then Z^ and Z^ are independent, in which case E[Z V Z W ] = E[Z„]E[Z TO ]. 
So the number of non-independent pairs (Z V ,Z W ) with both v,w G V(F) fl A^, is at most 
|F(F)n^||F| 2 . So 

Var(Z) = -E[Z] 2 + E[Z 2 ] 

m 

= -E[Z} 2 + J2 ^[Z V Z W ] 

v,w=l 

< -E[Z} 2 + \V(F) f] A^\\F\ 2 + E[Z} 2 <m\F\ 2 . 
Chebyshev's inequality applied to Z/m implies that for any t > 0, 

Pr \Z/m-E[Z]/m\ >t 

Here Pr[-\ denotes "the probability that". Observe that 

Z/m = \V{F) n^n B^/m = ((v{F) n^n B f ), 

E[Z]/m = k g {B^)\V{F) n Apl/m = ({V(F) n A^)k g (B^ 
Since \V(F)\ > (1 - <5)m, it follows that 



Var(Z) |Fp 

2+2 — m £2 



mi 



(4) 



((V(F) n A^n Bip) - ((A^n B^) 



<s, 



28 



C(v(F) n a\)k g {b^) - C{A^)k g {b, 



<5. 



So equation @] implies 



Pr 



C(^n.fy)-C(40« (-fy) >t 



< 



m(i - 25) 2 



for any £ > 25. Set £ = (pq) F 'e (p is the number of atoms of a and q = \K\ is the number 
of atoms of (3). By choice of 5, t > 25. Then if m is sufficiently large, 



Pr 



C(^n^)-C(^)« G (^) >( Pq ) 



< (pq)' 



'e. 



Therefore, the probability that |C(A^n -B^,) — C(v4 ?i )fi; G '(.B^)| > (p?) ' F ' e f° r some an d ^ is 
at most e. If this event does not occur then 

<fc(a x /3, a V /?) = J^ |/iX K G (^x^)-((^n^)| 

< J2 \KAi,)K G (Bt)-C{A4)K G (Bi,)\ + (pq)-\ F \e 

< e+ ^ |M^)-C(^)| 

= e + d F (a, a) < 2e. 
So if m is sufficiently large, then 



Pr 



d F (aV p,a x (3) < 2e 



> 1-e. 



It follows from the Shannon-McMillan theorem (or, more simply, from the law of large 
numbers) that for all e > there exists an M > such that if m > M then there is a set 
Q C K m such that 

• K m (Q) > 1 - e, 

• for all q G Q, exp ( — H(K)m — em) < ft m ({<?}) < exp ( — H(K)m + em). 

Let Q be the set of all J3 G Q such that d F (a V /?, a x (3) < 2e. Then K m (Q ) > 1 - 2e 
and 

|<5o| > ^ m (Qo) exp(if(«)m - em) > (1 — 2e) exp(if («)m - em) 

for all sufficiently large m. Since this is true for every a G AV(a, a : F,e) it follows that 

\AV{a,a x (3: F,2e)\> (1 - 2e) exp(#(V)m - em)\AV(a, a : F,e)| 

for all sufficiently large m. Thus, 

#(E, a x /? : F, 2e) > #(«) - e + F(S, a : F, e). 

Let e — > to obtain H (£, a x /? : F) > -^( K ) + #(E, a : F). This provides the lower bound 
and completes the proof. D 
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Theorem 11.11 follows immediately from the above theorem, since it implies proposition [221 
that h(E, G, K G , n G ) = H(k) whenever H(k) < oo. Of course, this uses theorem 12 .![ that 
h(E, ■) is a measure-conjugacy invariant. To prove corollary II. 2| we will need the following. 

Proposition 8.2. Let G be a countable sofic Ornstein group. Let (K, k) be a standard Borel 
probability space with H(k) = +oo. Suppose there exists a generating partition a for K G 
such that H(a) < oo. Let £ be a sofic approximation to G. Then h(H, G, K G , k g ) = — oo. 

Proof. Let (L, A) be a probability space with < H(\) < oo. By the previous proposition, 

h(Z, G, K G x L G , k g x A G ) = h(L, G, K G , k g ) + H{\). 

There is a canonical measure-conjugacy between (G, K G xL G , k g x\ g ) and [G, (KxL) G , (kx 
A) G ). Because G is Ornstein and H(n x A) = +oo, this system is measurably conjugate to 
(G,K g ,k g ). Thus 



/i(£, G, K G x L G , k g x A G ) = /i(E, G, K G , 



K 



From proposition 15.31 it follows that h(E,G,K G , k g ) < H(a) < +oo. To see this, let f3 be 
the trivial partition (3 = {X, 0}. Since H(X) > this implies H(T 1 ,G,K G , n G ) = — oo as 
claimed. □ 

Corollary 1 1.21 now follows from theorem II .11 and the proposition above. To prove theorem 
11.41 we will need: 

Corollary 8.3. Let (G, X, /i), (G, Y, v) be G-systems with finite- entropy generating partitions 
a, (3 respectively. If (G, X, /x) factors onto (G, Y, v) and S is a sofic approximation to G then 

/i(E, G, Y, v) > /i(S, G, X, fj.) - H(a). 

Proof. Without loss of generality, we can assume that X = Y and v equals /i restricted to 
the a-algebra generated by /3 and the action of G. By proposition [53J for every finite F C G, 

H(E, (3:F)- H(J3) > H (E, aV/3:F)- H(a V j3). 

Since H(a V /3) < if (a) + if (/?) this implies 

F(E, (3:F)> H(E, aVP:F)- H(a). 

Take the infimum over all F C G to obtain /i(E, /3) > /i(E, a V /?) — H(a). By theorem 12. 11 
/i(S, /5) = /i(E, G, F, i/) and /i(E, aW/3) = h(E, G, X, /i). So this proves it. □ 

We can now prove theorem 11.41 
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Proof of theorem \l-4\ Let G be a countable sofic group that contains a nonabelian free group. 



Let (K, k) be a standard Borel probability space with H(k) = +00 and let (L, A) be a 
probability space with < H(\) < 00. By theorem ll.3l (G, L G , X G ) factors onto (G, K G , k g ). 
If the latter has a finite-entropy generating partition, the previous corollary implies that for 
any sofic approximation S to G, 

h(Z, G, K G , k g ) > h(Z, G, L G , X G ) - H{\) = 0. 

This contradicts the previous proposition. So (G, K G , k g ) does not admit a finite-entropy 
generating partition. □ 
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